CDS
Accession Number | TCMCG015C06709 |
gbkey | CDS |
Protein Id | XP_027103815.1 |
Location | join(2499955..2500977,2501427..2501621,2501824..2501988,2502316..2502414,2502573..2502821,2503493..2503744,2503854..2503985,2504426..2504686,2504771..2504842,2505373..2505522,2506033..2506524,2506851..2507198) |
Gene | LOC113725047 |
GeneID | 113725047 |
Organism | Coffea arabica |
Protein
Length | 1145aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA506972 |
db_source | XM_027248014.1 |
Definition | DNA mismatch repair protein MSH3-like isoform X1 [Coffea arabica] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGGGAAGCAGAAGCAACAAGTAATTTCTCGTTTCTTTGCACCCAAACCCAAAACCCAAGAAGATTCTTCCTTTCCCAACGATCCATCATCATCCTCATCTTTGGCGTCCCCCTCCCAATCACAACCACCGCCAACCCCACCTCCGAAAGTCTTGTCCACCGTCTCATTTTCACCCGCCAAACGCCTTAGGACTTCCCGGCTCCTCTCTCAATCTCCAGCATCATCTGGTCACACCCCACAATCCCTTTTCCCTAAACCCTCCAAAAAACCCAAACTTTCACCCCACACCCATAATCCTATACCCCCTCTCTCCAATCCTACTCTTCACGAAAAGTTTTTAAAGAAACTCCTGGAACCCTCTCAAGAGCTTTTAGAGACCTCCAAGAATCAGCCGATTGTGAATCCTAAGTACACCCCATTGGAGCAGCAAGTGGTGGAGCTCAAGGCCAAGTACCCCGATGTCCTGTTGATGGTCGAAGTCGGGTATAAATATAGGTTTTTTGGTGAAGATGCGGAGAATGCTGCAAGGATTTTGGGGATTTATGCTCATATGGATCATAATTTCTTGACTGCAAGTATACCCACTTTTCGGCTGAATGTCCACGTGCGGAGGCTTGTGAGTGCAGGGTACAAAGTTGGCGTGGTGAAACAGACTGAAACTGCAGCAATTAAGGCTCATGGGACCAATAAACTGGGACCCTTTTGCCGCGGATTATCAGCATTGTACACCAAGGCCACCTTGGAGGCTGCCGAAGATTTGGGAGGTGGTCAGGAGGGGTGTAGTTCATGTAATAATTATTTGGTTTGTGTTGTGGAGCAGGAGGTTGAGATTGTGAAGGGTGCCCTTGAGAGTGGGGTTGACGTGAAAATTGGCGTTATTGGAGTTGAAATTTCGACTGGGGATGTCTTGTATGGGGAGTTCAGTGATAATTTTTTGAGAAGTGGTCTGGAGTCTATGGTTTTAAACTTGTCTCCTGCTGAGTTACTTCTGGGGAAACCACTATCGAAGCAGACTGAGAAGTTGCTCCTAGCATATGCTGGACCGGCCTCAAATATCCGTGTTGAACATACCTCACGAGATTGCTTCACAGAAGGTGGTGCACTTGCTGAAGTGATGTCTCTGTTTGAGGGGATGACTGGAAATAAGCTAGGAGATTCCCATCACAAGGGAGATGTAGAGGCCAAAGAAAATGACAGCAATTGCTCTCCATTTGAGGGAATTATGGCACTACCTGATTTGGTAATCCAAGCATTAGGTCTAACCATTCGTCATCTCAAGCAATTTGGTCTTGAAGGAGTTCTCTGCTTGGAAGCTTCATTCCGGCCTCTATCTACCAAATTGGAGATGACCCTTACTGGCAATGCACTTCAACAACTGGAGGTTCTGAAGAATAATGCTGACGGTTCAGAGTCTGGCACCTTGCTGCAGTGTATGAATCATACTCTTACAATATTTGGTTCAAGGCTTCTTAGGCATTGGGTGGCTCATCCTTTATGCGATAGAAGCATGATATATGCTCGTCTTGATGCAGTTTCTGAGATTGTGGAATCTATGGGGGCCTTTAAAGCTTCTAGTAATTGTGAAAGTGACGGTGAAGAATCCGATATCATTACTATGCAGCCTGAAGTTCATGATATTCTTTCTTCGGTATTGACCTCTTTGGCTAGATCACCTGATATTCAACGTGGGATAACAAGAATATTCCATCGAACAGCCAAAGCAGCAGAGTTTATAGCTGTCATTCAAGCTATTCTACTTGCTGGGAAGCAGCTTCAGCAACTTCGTGGTCAGGAGGAGATGGAATACAAGAATTTGCAGACAACAGTTCACTCGCCCCTGCTGGTGAAGTTAATAATGGCAGCTTCATCATCAAGTATCCTTGGTACGGCTGCAAAACTATTGTCTGGGCTCAATAAAGAAGCTGCTGATCAAAAAGATCTTCACAATTTATTCATCATCTCTGATGGACAATTCCCAGAGGTTGCCGAAGCAAGGCAAAAAGTTCAGTTGGCTAATGAGAACTTAGATTCGATGATTAGCACTTATCGTAAACAAGTGCAAGATCGCAGTTTGATGTTCACGAGTGTAGCTGGTATTACTCATTTGATAGAGTTGCCACTAACCGTGAAGGCGCCTCTAAATTGGTTGAAGGTAAATAGTACCAAAAAAACAATTCGCTATCACCCTCCTGAGGTTTTGATGGCTTTGGACCAGTTATCCTTGGCAAAAGAGGAGCTTACTCTTGTTTGCCAAGCTGCTTGGGAGGGTTTCTTGAAGGCCTTTGGTGGATATTATGCTGAGTTCCAAGAGGCTGTCCACGCTCTAGCTGCCTTGGATTGCCTGCATTCACTTTCCATTCTTTCAAGGAATAAGAATTATGTTCGTCCTGTTTTTGTCAATGATAATGAGCCAGTTCAGATACAGATATCTTCTGGCCGTCATCCTGTTATGGAGACCGTATTACAAGATAATTTTGTCCCAAATGATACAAATTTGCATGCTGAAGGAGAGTACTGTCAAATTATTACTGGACCAAACATGGGTGGAAAAAGCTGCTATATTCGCCAAGTTGCTCTGATTGCTATCATGGCTCAGGTCGGTTCCTTTGTACCAGCACTATCTGCAAAGCTGCATGTGGTAGATAGTATTTATACTCGAATAGGAGCTTCTGACAGTATTCAACGAGGAAGAAGTACCTTTTTGGAAGAACTGAGTGAAGCTTCTCTCATACTGCGGAATTGCACGACCCGCTCGCTGGTTATTATCGATGAGCTTGGCAGAGGGACAAGTACACATGATGGTGTAGCAATTGCCTATGCTACATTGCAATATCTTCTTGAGAATATAAGATGTATGGTCCTATTTGTCACCCATTACCCTAAGATAGCTGATATCAAGAATGAATTTCCAGACTCCGTGGCAGCATATCATGTTTCATATCTGACTTCGCAGAGAGATGATCAACTGGGTTTAGACTCTAACTTGACTGTGGATGGCATGAATCAAGAACATATCACTTACCTTTACAAACTTGTGCCTGGTGTTTCAGAAAGGAGCTTTGGCTTCAAGGTAGCTCAGCTTGCAGAGCTGCCATCGTCCTGTATTGAACGAGCCATTGAAATGGCTACAAGATTGGAAGCAGCAGTATGCAACAGAGAGAGAGAAAGGCTGGTGATGAAATGTGCCACAGAAAGTGAACTGAATTTAAGCGATAAAGCAAAGGCGAGAGAAGATGAAGAAAGAGAAGAGAGCATCTTGAATCCTGTTGATTCCTTGGGTACTGGAAAGATCGAAAGCTTAAGAGTATTTTGCGATGCTTGGAGGGAGTTCTTTCCATACTTGAACCTTGCAGTTTCAGGAGAAAGTGATGATGCAGAAAGGCTCCAAATTTTGAACCTTGCGAAGAGACTTGCACTTGAGTTGATAAACAGATGA |
Protein: MGKQKQQVISRFFAPKPKTQEDSSFPNDPSSSSSLASPSQSQPPPTPPPKVLSTVSFSPAKRLRTSRLLSQSPASSGHTPQSLFPKPSKKPKLSPHTHNPIPPLSNPTLHEKFLKKLLEPSQELLETSKNQPIVNPKYTPLEQQVVELKAKYPDVLLMVEVGYKYRFFGEDAENAARILGIYAHMDHNFLTASIPTFRLNVHVRRLVSAGYKVGVVKQTETAAIKAHGTNKLGPFCRGLSALYTKATLEAAEDLGGGQEGCSSCNNYLVCVVEQEVEIVKGALESGVDVKIGVIGVEISTGDVLYGEFSDNFLRSGLESMVLNLSPAELLLGKPLSKQTEKLLLAYAGPASNIRVEHTSRDCFTEGGALAEVMSLFEGMTGNKLGDSHHKGDVEAKENDSNCSPFEGIMALPDLVIQALGLTIRHLKQFGLEGVLCLEASFRPLSTKLEMTLTGNALQQLEVLKNNADGSESGTLLQCMNHTLTIFGSRLLRHWVAHPLCDRSMIYARLDAVSEIVESMGAFKASSNCESDGEESDIITMQPEVHDILSSVLTSLARSPDIQRGITRIFHRTAKAAEFIAVIQAILLAGKQLQQLRGQEEMEYKNLQTTVHSPLLVKLIMAASSSSILGTAAKLLSGLNKEAADQKDLHNLFIISDGQFPEVAEARQKVQLANENLDSMISTYRKQVQDRSLMFTSVAGITHLIELPLTVKAPLNWLKVNSTKKTIRYHPPEVLMALDQLSLAKEELTLVCQAAWEGFLKAFGGYYAEFQEAVHALAALDCLHSLSILSRNKNYVRPVFVNDNEPVQIQISSGRHPVMETVLQDNFVPNDTNLHAEGEYCQIITGPNMGGKSCYIRQVALIAIMAQVGSFVPALSAKLHVVDSIYTRIGASDSIQRGRSTFLEELSEASLILRNCTTRSLVIIDELGRGTSTHDGVAIAYATLQYLLENIRCMVLFVTHYPKIADIKNEFPDSVAAYHVSYLTSQRDDQLGLDSNLTVDGMNQEHITYLYKLVPGVSERSFGFKVAQLAELPSSCIERAIEMATRLEAAVCNRERERLVMKCATESELNLSDKAKAREDEEREESILNPVDSLGTGKIESLRVFCDAWREFFPYLNLAVSGESDDAERLQILNLAKRLALELINR |